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The experimental violation of Bell inequality establishes necessary but not sufficient conditions 
that any theory must obey. Namely, a theory compatible with the experimental observations can 
satisfy at most two of the three hypotheses at the basis of Bell's theorem: free will, no-signaling, 
and outcome-Independence. Quantum mechanics satisfies the first two hypotheses but not the lat- 
ter. Experiments not only violate Bell inequality, but show an excellent agreement with quantum 
mechanics. This fact restricts further the class of admissible theories. In this work, the author de- 
termines the form of the hidden- variable models that reproduce the quantum mechanical predictions 
for a spin singlet while satisfying both the hypotheses of free will and no-signaling. Two classes 
of hidden-variable models are given as an example, and a general recipe to build infinitely many 
possible models is provided. 



I. INTRODUCTION 

Are there theories more fundamental than quantum 
mechanics? Since the groundbreaking work of Bell [I] 
this question has garnered increasing attention. These 
purported more fundamental theories are known as 
hidden-variable models, since they rely on the existence 
of parameters, the hidden variables, that are distinct 
from the wave function and from the classical observ- 
ables (as energy, positions, etc.). 

A considerable result was achieved by Bell and per- 
fected by others [..-■'>] who showed that a whole fam- 
ily of such theories could be experimentally tested even 
though no explicit hypothesis about their mathemati- 
cal structure nor about the additional parameters was 
made, by requiring, instead, that the models lead to 
probabilistic predictions satisfying some "reasonable" as- 
sumptions. These assumptions, discussed at length be- 
low, are known as Measurement-Independence, Setting- 
Independence, and Outcome-Independence. The first 
one may be justified invoking both [4] the impossibility 
of action-at-a-distance and the independence of the mea- 
surement settings from any variables ("free will"); the 
second one is a consequence of the impossibility of su- 
perluminal signaling; the third assumption, however, is 
more difficult to justify [5, 6]. More recently, Leggett [~t \ 
demonstrated the incompatibility of quantum mechan- 
ics with all models satisfying Measurement-Independence 
and a stronger form of Setting-Independence, the com- 
pliance with Malus's law. 

Experiments [8, 9] not only show a clear violation of 
both Bell and Leggett inequalities, but they reproduce 
accurately the predictions of quantum mechanics, since 
the discrepancies can be explained by the unavoidable 
imperfections of preparation and measurement. Thus, 
the constraints put on hidden variable models by the 
current experimental evidence are even stricter than the 
simple incompatibility with at least one of the three hy- 
potheses of Bell (or of the two hypotheses of Leggett): 
after averaging over the hidden variables the predictions 
of quantum mechanics must be reproduced, in order for 
the theory to be admissible. 



In the present paper, we consider models satis- 
fying both Measurement-Independence and Setting- 
Independence, so that the principles of "free will" and 
no-signaling are satisfied. By building upon a recent 
theorem [10, 11], we shape the form of all such hidden- 
variable theories that are compatible with quantum me- 
chanics and hence with experiments. 

II. THE SYSTEM AND THE GOAL 

The system of interest is a pair of particles in a spin- 
singlet configuration which fly to space-separated loca- 
tions. We use the language of spin, rather than polariza- 
tion of light, since the formulas are slightly more com- 
pact. The events consist in the determination of the spin 
projection along a given axis for each particle. We choose 
units such that the outcomes for each particle are a, t G 
{— 1, 1}. The measured observables are the spin projec- 
tions a - Si and b - S 2 , which we shall indicate simply by a 
and b. For brevity, we write the conditional probability 
of observing the outcome {er, r} for given values of a, b 
and hidden variables A as P(cr, r|A, a, b) = F > (J T (A, a, b) . 
Quantum mechanics predicts that 

pQf(^a,b) = i[l-<7ra-b], (1) 

where %p describe the preparation of two particles in a 
singlet state. Our goal is to determine a positive measure 
d/jL and a conditional probability such that integration 
over the hidden variables yields pQ M , namely 

J d/x(A|a, b)P CT>T (A, a, b) = ~ [1 - ara • b] . (2) 

The preparation tp is henceforth omitted, and it is under- 
stood that it appears as a prior in all the probabilities. 

III. HYPOTHESES AT THE BASIS OF BELL 
AND LEGGETT INEQUALITIES 

The models excluded by Bell inequality rely on three 
hypotheses: Measurement-Independence (which we refer 
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to as Uncorrelated Choice), Outcome-Independence (for 
which we propose the more descriptive term "Reducibil- 
ity of Correlations"), and Setting-Independence. The 
models excluded by Leggett inequality rely on Uncor- 
related Choice and compliance with Malus's law. As one 
or more assumptions must be violated, we shall briefly 
discuss the physical meaning of these assumptions, in or- 
der to individuate the least problematic hypotheses to 
drop. In the following, we shall use the word "locality" , 
by which we mean simply the impossibility of superlumi- 
nal signaling. 

Uncorrelated Choice (UC), sometimes called 
Measurement-Independence, means that the distri- 
bution of the A and the settings of the detectors are 
uncorrelated. If one thinks of A as a set of parameters 
attached to the physical system, then Uncorrelated 
Choice follows from locality. However, it may happen 
that A is correlated with the choice of the observables 
to be detected, due to some past common cause [12], 
and in this case Uncorrelated Choice can be violated 
even though locality holds [4, 12-14]. Indeed, if one 
considers that the choice of settings, be it done by an 
automatic random mechanism or by a conscious being, 
can be influenced by events in the past light-cone of 
either station A or B, and considers further that these 
light-cones have an intersection between themselves and 
with the past light-cone of the entangling apparatus, 
it is possible, in principle, that there are correlations 
between the hidden variables and the choice of settings. 
Given our actual knowledge, however, this is a remote 
possibility. Usually, it implies a limitation of free will, 
or a conspiracy of sorts, but there is a possibility that 
what appears a conspiracy today is but a manifestation 
of some fundamental law. 

Reducibility of Correlations (RC), known also as 
Outcome-Independence, means that the conditional 
probability of the outcome r, given A and given that 
the outcome of the measurement of a is a, does not de- 
pend on the latter, namely P T (A, a, b, a) — P r (A, a, b), 
so that the joint probability is P CTiT (A, a, b, a) = 
P CT (A, a, b)P T (A, a, b). Hence, if the parameters A could 
be accessed, by either measuring them or fixing them, 
there would be no correlations. After averaging over A, 
however, correlations appear. Thus Reducibility of Cor- 
relations means that the quantum correlations emerge 
from the ignorance of some more fundamental parame- 
ters. In order to check whether Reducibility of Corre- 
lations holds, the observer at A must calculate the con- 
ditional probability P a (r, a, b, A) (still assuming that A 
can be accessed by A) , and check whether it varies when 
t varies, while the other parameters are fixed. In order 
to do so, A must have access to the remote information 
b, t, which B can send only at a speed not exceeding the 
speed of light. Hence, violating Reducibility of Correla- 
tions does not imply action-at-a-distance, nor the possi- 
bility of instantaneous communication, 

Setting-Independence (SI) means that the marginal 
probability of observing the event a at A, for a given A, 



does not depend on the setting b, namely P CT (A,a, b) = 
P CT (A, a). It may seem that the violation of Setting- 
Independence gives the possibility of instantaneous sig- 
naling, so that locality implies Setting-Independence. 1 
However, this is true only if A has a fixed known value, 
or if it can be completely determined by a measurement 
at location A (or B). 

Finally, compliance with Malus's law requires that 
the hidden-variables consist in a unit-vector such that 
the marginal probability is P CT (u, a, b) = (1 + era • u) /2. 
Therefore, this hypothesis is a special case of Setting- 
Independence. We remark that this hypothesis tries to 
give a physical meaning to the hidden variables, assum- 
ing that they are made of unit vectors in such a way 
that each spin (or photon) possesses a well defined polar- 
ization, in such a way that, if the polarization could be 
fixed, the ordinary Malus's law would be obeyed. 

By relaxing the hypothesis of Uncorrelated Choice, 
e.g., it is possible to violate both Bell and Leggett in- 
equalities [4, 12-14], a necessary condition to reproduce 
on average the results of quantum mechanics. Other pos- 
sibilities explored in the literature consist in violating 
both Uncorrelated Choice and Reducibility of Correla- 
tions [15-17], or only Setting-Independence[18, 19]. 



IV. EXAMPLES 

Now, let us construct a family of models compatible 
with quantum mechanics. We consider only models obey- 
ing the hypotheses of Uncorrelated Choice and Setting- 
Independence, since the violation of either hypothesis 
may have controversial implications. A big help is pro- 
vided by the trivial-marginals theorem, derived (under 
assumptions slightly stronger than the strictly necessary 
ones) by Colbeck and Renner [10] and Branciard et al. 
[11], and rederived (under minimal assumptions) in Ap- 
pendix A. This theorem states that all hidden variable 
models that satisfy Uncorrelated Choice and Setting- 
Independence while reproducing the quantum mechan- 
ical predictions, must have a A-conditioned probability 
of the form 

P CT , r (A,a,b)= i/l-(rr[a-b-C(A,a,b)]|, (3) 

where C has a zero average with the weight c?/i(A) (which 
represents the probability distribution of the A), and 
C( A, a, a) = with the exclusion of the subsets of A where 



Some authors, indeed, identify Setting-Independence with no- 
signaling, the impossibility of instantaneous communication, 
while they reserve the term "locality" sometimes to mean Re- 
ducibility of Correlations, other times to mean both Reducibility 
of Correlations and Setting-Independence, and other times still 
to refer to the three hypotheses Uncorrelated Choice, Reducibil- 
ity of Correlations, and Setting-Independence. 
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/x(A) is identically zero. In particular all models satisfy- 
ing Malus's law are excluded by the theorem. In other 
words, assuming Uncorrelated Choice and Malus's law 
(which is a special case of Setting-Independence) results 
in theories incompatible with quantum mechanics. The 
function G(A, a, b) represents the excess or defect cor- 
relations (with respect to the quantum mechanical cor- 
relations) attributable to the hidden variables. Indeed, 
if A could be fixed, either by the specification of a suit- 
able preparation procedure or by post-selection provided 
a prescription for its measurement is given, then the ob- 
served spin-spin correlations for detectors oriented along 
a, b would be Corr(A, a, b) = —a • b + C(A, a, b). As G 
must vanish on average, it can take both positive and 
negative values for different As, thus the correlations, at 
the hidden variable level, may be stronger than the quan- 
tum mechanical correlations. 

No actual examples of models satisfying the trivial- 
marginals theorem were made so far, and we proceed 
to fill this gap. First, we notice that not any choice of 
G leads to a positive-defined probability. For instance, 
choosing 



C = v/l-(a-b) 2 G(A) 



(4) 



leads to regions of negative probabilities for any function 
G having zero average. An important result discussed in 
the next section will be to provide a recipe for building 
up all admissible functions C . As an example, we choose, 
e-g-, 



G(A,a,b)= [l-(a-b) 2 ] G(A), 



(5) 



with |G(A)| < 1/2 and / c?A^(A)G(A) = 0. It is easy to 
check that the probability in Eq. (3) is always positive 
and that upon averaging over A Eq. (2) is satisfied. Thus 
we have constructed a family of hidden- variable theories 
that reproduce quantum mechanics. Notice that neither 
the hypothesis of Rcducibility of Correlations, needed in 
order to derive Bell inequality, nor the hypothesis of com- 
pliance with Malus's law, needed for Leggett inequality, 
are satisfied. 

Another family of local models, i.e. requiring and al- 
lowing no instantaneous communication between the two 
wings, is obtained by choosing 

G(A, u, a, b) = a • b [(a • u) 2 - (b ■ u) 2 ] 2 G(A), (6) 

with, as before, |G(A)| < 1/2 and having zero average, 
while u is a unit-vector hidden variable. 

It can be shown that the model of Cerf et al. [15] can 
be reduced to the form 



i«(u,v|a,b) = 
P CTT (u,v,a,b) = 



1 



(4tt) 2 
1 



(7) 



1 — err sgn(u-a) sgn(n + -b) 



1 



' a.u.v ~r yb,u,v ^ a.u.v '7b. u.v 



r = sgn(u-a) sgn(v-a), y b ,u,v = 
sgn(n + • b) sgn(n_ • b), and n± = u ± v. The reader 
can verify that this model reproduces the quantum me- 
chanical predictions while it satisfies UC and SI, but vi- 
olates RC and thus falls within the family of models we 
are interested in. 



V. MAIN THEOREM 

While the examples above were found by trial and er- 
ror, a careful analysis of the presence or lack of negative 
regions for the probabilities leads to the main result of 
the present paper. 

Theorem. The function C in Eq. (3) is of the form 

G(A, a, b) = [1 + a • h] s+ [1 - a • b] s ~ G(A, a, b), (9) 
with 

< |G(A,a,±a)| < oo, for A e D± , (i(D±) > 0, (10a) 
d/x(A)G(A,a,b) = 0, (10b) 



-1 

[i-a-br-Mi 



a - bl 



< G(A,a,b) 



1 



< r, (10c) 

" [l-a-b] s - [l + a-b]^ 1 V ; 

s+ > 1 , s_ > 1, (lOd) 

|G(A,a,±a)| < 1/2 S± if s T = 1. (lOe) 

Proof. In order to satisfy G(A,a, ±a) = 0, the function 
G must be of the form 

G(A, a, b) = [1 + a ■ b] s+ [1 - a ■ b] s ~ G(A, a, b), (11) 

with s + > 0, s_ > and < |G(A, a, ±a)| < oo. This 
is a Frobenius-like expansion, with s± determining how 
fast the function vanishes for ab = =pl, so that Eq. (10a) 
follows by definition, with a domain of A having 
non-zero measure (if G is identically zero almost every- 
where when a ■ b = ±1 we can then redefine s± and 
G). Equation (10b) follows from J dfi(X)C(X, a, b) = 0. 
The positivity of the probability implies the inequali- 
ties in Eq. (10c). These inequalities also guarantee that 
none of the four probabilities exceeds one, since one can 
readily verify the more precise inequality P CT r (A,a, b) < 
1/2, V<7, t, A, a, b. Furthermore, if we let a • b = ±(1 — e), 
we have that the probability of {a, ±cr} is, to lowest order 



cr,±<7 



[IT 2 



S ±P S T" 



1 G(A,a,±a)] ) (12) 



(8) 



therefore, remembering that G(A,a, ±a) changes sign 
when varying A, in order for the probability to be pos- 
itive we must have s± > 1, proving Eq. (lOd). Fi- 
nally, assuming that s± = 1, Eq. (12) implies that 
|G(A,a,±a)| < 1/2 s t for s± = 1, so that Eq. (lOe) is 
proved. □ 
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The two families described by Eqs. (5) and (6) have 
s+ = S- = 1. There may be pathological cases which 
are not captured by our theorem, e.g., it may happen 
that G(A, a, b) averages to zero for all b ^ ±a, but that 
G(A, a, ±a) has a constant sign for varying A, so that 
Eqs. (lOd) and (lOe) may not be satisfied. This behavior 
requires some essential non-analyticity, and we believe it 
is not physically interesting. 

Let us provide a constructive recipe to build families 
of hidden-variable models. We choose s + = s_ = s 
just for the sake of symmetry. Now we pick an ar- 
bitrary limited function /(A, a, b) having a finite value 
for J dn(X)f(X, a, b), where A may include vectors, 
scalars, discrete variables (in which case the integral is 
a sum). We build the zero-average function g(X,a, b) = 
/(A, a, b) — J dfi(X')f(X',a,h). We consider the supre- 
mum and infimum of g, M and m. By construction 
M > and m < 0. If they satisfy 



(.s-1/2) 2 *- 1 w (s-1/2) 28 - 1 

< m < M < 



s s (s - ly- 1 

then our job is done, since 
-1 

(l-a-b^-^l + a-b) 4 

(s- l/2) 2s - 1 

< - — ; — —. — - < 



s-l 



< - 



s s (s - 1) 



(s-l/2) 2s - 1 

s 6 (s - ly- 1 
i 



(13) 



(s - l) 8 - 1 - (1 - a • b) s (l + a • by- 1 ' 



(14) 



Otherwise, we multiply g by an appropriate factor, 
so that Eq. (13) is satisfied. The resulting function 
G(A,a,b) = [1 - (a • b) 2 ] s 5 (A, a, b) satisfies all the hy- 
potheses of the main theorem by construction, and we 
have built a hidden variable model. 



VI. DISCUSSION 

There appears to be a contrast between the results 
presented in the paragraph above and those reported in 
two recent papers [19, 20]. We shall briefly discuss these 
contrasts. 

Reference [ ] claims that "the assumed experi- 
menter's freedom to choose the settings ensures that the 
setting information must be non-locally transferred even 
when the SI condition is obeyed" and concludes that the 
work "provides the general conditions that every non- 
local hidden variable theory has to satisfy in order to 
allow for violation of the CHSH inequality" . These con- 
clusions are evidently wrong, as we have provided mod- 
els obeying Setting Independence and not only violating 
the CHSH inequality, but reproducing the full quantum 
mechanical predictions. As shown in Appendix B, the 
conclusions of Ref. [19] are valid provided that they are 
restricted to models satisfying certain hypotheses. One 
of these hypotheses is that the conditional probability is 
not extracted from experimental data, but is simulated 
at location A according to some algorithm. Here, in- 
stead, we are considering the possibility that, in addition 



to the wave-function, there exist further parameters A 
giving a finer description of the system. We agree with 
Ref. [ ] that, if the (allegedly) experimentally accessible 
conditional probabilities were to be reproduced through 
an algorithm, then both b and r should be transmitted 
to A. 

On the other hand, the results presented here show 
that quantum mechanics can be extended through the 
specification of additional parameters A, and that this ex- 
tension has improved predictive power, since the function 
C(A, a, b) is non-zero, and consequently the predicted 
joint probability for given A differs from the quantum 
mechanical one: 



iV(A,a,b) 



:i{l-or [a-b-G(A,a,b)]} 



(15) 



This seems to contradict the findings of Ref. [20] . How- 
ever, in Ref. [ ], the impossibility to have an im- 
proved predicted power refers to the marginal probability 
P CT (A,a), not to the joint one P CT>T (A, a, b). The models 
discussed in the present work predict marginal proba- 
bilities of P CT (A,a) = 1/2 and hence do not contradict 
Ref. [20]. In other words, the apparent tension is due to 
the definition of 'extension of quantum theory'. 



VII. CONCLUSIONS 

In conclusion, we have established the form of 
all the hidden variable models able to reproduce 
the quantum mechanics of a spin-singlet by satis- 
fying both the assumptions of "free will" and no- 
signaling, which correspond, respectively, to Uncorre- 
lated Choice (Measurement-Independence) and Setting- 
Independence. By contrast, we have assumed the vio- 
lation of Reducibility of Correlations, as this can never 
result in superluminal signaling, since it consists in the 
dependence of a conditional probability on a remote out- 
come, and as such it requires the communication of said 
outcome through means that are necessarily subluminal. 
Rather, the violation of Reducibility of Correlations im- 
plies that the quantum correlations cannot be attributed 
to the ignorance of the hidden parameters. 
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Appendix A: The trivial-marginals theorem 

We prove a theorem established in Ref. [10] assum- 
ing that the hidden variables can be written A = 
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AlU AoU ^R' with \l,r local parameters associated to 
the measurement at location L, R admitting a factorable 
measure, and in Ref. [11] for discrete variables only. The 
proof below relies on none of these additional assump- 
tions. 

Theorem. All hidden-variable theories that satisfy Un- 
correlated Choice and Setting- Independence, and that re- 
produce the quantum mechanical predictions for spin sin- 
glets predict conditional probabilities of the form 

P CT , T (A,a,b)= i|l-oT[a.b-C(A,a,b)]|, (Al) 



where 



dfi(X)C(X, a, b) = 0, 

C(A,a,±a) = 0, 
|a-b-C(A,a,b)| < 1, 



(A2a) 

(A2b) 
(A2c) 



with /x(A) a measure. 



Proof. Consider a hidden variable theory that tries to re- 
produce the quantum mechanical predictions for a spin 
singlet. It must satisfy Eq. (2), with d/i(A|a, b) a posi- 
tive measure. Generally d/i(A|a, b) = //(A|a, b)<iA, and 
the positive normalized generalized function /i can be 
interpreted as the probability density of A for given 
a, b. Measurement-Independence implies that the mea- 
sure does not depend on the settings of the detectors, i.e., 
d^(A|a, b) = g?/i(A) or /x(A|a, b) = ^(X), Without loss of 
generality, we put 

P CT)T (A, a, b) = - [1 - ara • b + A ff , T (A, a, b)] . (A3) 

The function A CTiT (A, a, b), by definition, satisfies 

/"d/i(A)A CT , T (A,a,b)=0, (A4) 
]TA ff , r (A,a,b) =0. (A5) 

cr,T 

and it can be written as 

A a T (A, a, b) = aA(X, a, b) + tB(X, a, b) + arC(X, a, b), 

(A6) 

with all three functions satisfying Eq. (A4). In particu- 
lar, Eq. (A2a) is satisfied. Setting-Independence requires 
that the marginal probability of observing the outcome a 
at detector a is not influenced by the direction b chosen 
for the other detector, and vice versa, namely 

P CT (A, a, b) = PcrA\ a , b ) =P<r(X, a), (A7) 

r 

P T (A, a, b) = Y, P °AK a, b) =P T (A, b). (A8) 

Thus, we have that 

A(X, a, b) = A(X, a) , B(X, a, b) = B(X, b). (A9) 



In particular, quantum mechanics predicts perfect 
(anti)correlations when a = — b (a = b). This implies 
that (here it is fundamental that the measure is indepen- 
dent of a and b) 



A(X, a) + P(A, a) = A(X, a) - B(X, -a) 
C(A,a,a) = C(A,a,-a) = 0, 



0, 



(A10) 
(AH) 



identically almost everywhere 2 in A and in a. Equa- 
tion (A10) is satisfied by B(X, a) = — A(X,&), with 
A(X,— a) = — A(X,a) an odd function of its second ar- 
gument. Consider now values close to the perfect anti- 
correlation point, b = (a + S)/\/l + S 2 , with a • S = 
and \8\ <C 1. To first order, the probability P tTjCr (A, a, b) 
reads 



P CTiCr (A,a,a+<5) 



1 



dA(X,n) <9C(A,a,n) 



dn 



dn 



(A12) 

where n is a generic placeholder for a unit vector. Clearly, 
Eq. (A12) cannot be positive for all S. If it is positive 
for a value So, it will be negative for S = — So. The only 
possibility is that the term inside the brackets in (A12) 
vanishes identically in A and a or that it is proportional to 
a. By changing the sign of cr, summing and subtracting, 
we notice that the following two identities should hold 



dA(X, n) 



<9n 

<9C(A,a, n) 



On 



/(A,a)a, 
= 3(A,a)a. 



(A13) 
(A14) 



Since A is an odd function and a a unit vector, Eq. (A13) 
implies that ^4(A, n) = 0: Indeed invariance requires that 
the dependence on the argument can be only of the form 
A(X, n) = A(X, n-pj), where Pj are vectors either fixed or 
depending on the hidden variables (possibly being some 
of the hidden variables). We have then that 



A X 



-A(A,a) ~^p r 5 



This implies that 



dA(X,Xj) 



dA(X, Xj ) 
dxj 



0. 



Xj — a-pj 

(A15) 



(A16) 



and hence A(X,Xj) = /(A). Since A(X, — Xj) = 
-A(X,-Xj), A(X,Xj) = 0, i.e. the validity of Eq. (Al) 
was proved. Then Eqs. (A2b) and (A2c) follow from the 
positivc-definitcness of the probability, the former being 
implied by the latter and by Eq. (A2a). □ 



2 'Almost everywhere' means in all subsets having non-zero mea- 
sure. If A has a discrete distribution, so that fi(X) is a sum of 
<5-functions, 'almost everywhere' means, paradoxically, only at 
the discrete values of A. 
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Appendix B: Conditions of validity for the theorem 
of Pawlowski et al. 

Reference [19] proves that a family of hidden variable 
theories requires one of the party to have information 
about both the remote setting and the remote outcome. 
In the following, we clarify the assumptions actually 
made in Ref. [19], and show that the models discussed in 
the present manuscript do not comply with these assump- 
tions, so that there is no contradiction. First of all, we 
notice that two hypotheses are made explicitly: "freedom 
of choice" and "realism". The second hypothesis is but 
counterfactual-definiteness, i.e. the existence of a master 
probability P(Aq,A%, Bq, .Bi|ao, 01, bo, b%) such that the 
observed probability P{Aj, Bk\a,j, b k ) for any two settings 
a.j 1 bk is its marginal. It is well known [14, 21] that the hy- 
pothesis of counterfactual-definiteness alone is sufficient 
in order to derive Bell-type inequalities. Thus, Ref. [19] 
is not actually using the hypothesis of "realism", or the 
models considered could not possibly violate the CHSH 
inequality. This leaves only the hypothesis of "freedom 
of choice", which is akin to what in the present paper 
is referred to as "Uncorrelated Choice" (or Measurement 
Independence). There is a difference, however, in that 
the "freedom of choice" used in Ref. [19] refers to free- 
dom only within two possible choices. 3 Furthermore, in 
addition to this hypothesis, Ref. [19] makes other as- 
sumptions that are sparse in the text and not stated as 
hypotheses, and the conclusions are not restricted to the 
models satisfying said assumptions. Let us enunciate all 
the hypotheses actually made: 

1. A and B are limited to two choices each &j,hk, 
J, A = 0,1. 

2. Within this restriction, the choices are not influ- 
enced by the hidden parameters, and vice versa, so 
that p(j, fc|A) = p(j, k) = 1/4 and fi(X\j, k) = /x(A). 

3. A and B are mimicking the results of a measure- 
ment, they are not actually performing one. To this 
goal they are sharing an information A. 

4. B gives an output r according to an algorithm that 
provides a number, < P+(\, k) < 1: if a random 



number between and 1 is larger than PS(X, k), B 
will output r = — 1, otherwise r = +1. 

5. A receives an information X from B, in addition 
to A, and tries to mimic the conditional probability 
P CT (A, a, b, t) by an algorithm providing a threshold 

Reference [19] demonstrates that under hypotheses 
(l)-(5), if the CHSH inequality is violated then 
ma,x k {Prob(k\X,X)} > 1/2 and max r {Pro6(r|A, X)} > 
1/2, at least for some X,X. Hypothesis (1) is crucial: in 
the Toner and Bacon model, the information X — c does 
not allow to extract any information about the remote 
outcome, if a, b can vary over the whole unit sphere. Nev- 
ertheless, the models presented herein are valid for any 
distribution of the settings, and they can be restricted 
to two binary choices of polarizations. Hence, the rea- 
son of the apparent discrepancy does not reside in hy- 
pothesis (1). The key, instead, is hypothesis (5): the 
observer at A is not measuring a physical property of 
a system, but is calculating a number through an algo- 
rithm, which receives X,X as an input, and mimicking 
a conditional probability accordingly. By contrast, let 
us see how A would estimate the conditional probability 
from the experimental data if a measurement was actu- 
ally performed: First, A and B make a large number of 
measurements. They disclose the settings a, b and the 
outcomes a, r that they used and observed in each in- 
dividual trial. Then A selects the data for which, say, 
b = bo and r = tq, and estimates the conditional proba- 
bility of obtaining a with the frequency that was observed 
in this subset of data. Thus, both setting and outcome 
information must be sent to A in order to extract the 
conditional probabilities. The results of Ref. [19] put 
some restrictions on the models that try to reproduce 
the conditional probability through an algorithm, but do 
not affect models, like the ones introduced in our pa- 
per, that assume the existence of additional parameters 
A giving a finer description of a physical system. In this 
case, the outcome and setting information needs to be 
sent only after the measurements have been performed, 
by the very definition of conditional probability. 
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